Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
który | 8129 | 442 | 1 | 442.0000 |
która | 4232 | 239 | 1 | 239.0000 |
ale | 12128 | 326 | 3 | 108.6667 |
które | 7317 | 355 | 4 | 88.7500 |
Na | 6383 | 344 | 4 | 86.0000 |
Po | 3896 | 237 | 3 | 79.0000 |
Według | 1035 | 79 | 1 | 79.0000 |
Do | 2770 | 150 | 2 | 75.0000 |
że | 35777 | 1108 | 22 | 50.3636 |
Jego | 1106 | 48 | 1 | 48.0000 |
W | 20027 | 841 | 18 | 46.7222 |
Jednak | 1048 | 45 | 1 | 45.0000 |
bo | 5734 | 219 | 5 | 43.8000 |
Ale | 3072 | 126 | 3 | 42.0000 |
Od | 1769 | 126 | 3 | 42.0000 |
Nie | 6965 | 289 | 7 | 41.2857 |
Gdy | 1023 | 37 | 1 | 37.0000 |
którzy | 3602 | 201 | 6 | 33.5000 |
Są | 579 | 33 | 1 | 33.0000 |
Prezydent | 452 | 33 | 1 | 33.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
l. | 764 | 2 | 64 | 0.0313 |
odbędzie | 569 | 1 | 30 | 0.0333 |
r | 261 | 1 | 20 | 0.0500 |
pojawiły | 299 | 1 | 16 | 0.0625 |
pojawił | 319 | 1 | 15 | 0.0667 |
okazało | 452 | 2 | 28 | 0.0714 |
fakt | 404 | 2 | 28 | 0.0714 |
wraz | 690 | 2 | 24 | 0.0833 |
nadzieję | 541 | 2 | 24 | 0.0833 |
dostęp | 229 | 1 | 12 | 0.0833 |
mistrzostw | 643 | 5 | 54 | 0.0926 |
związanych | 294 | 2 | 21 | 0.0952 |
2021r | 380 | 1 | 10 | 0.1000 |
dodatkowo | 226 | 1 | 10 | 0.1000 |
Stanów | 204 | 1 | 10 | 0.1000 |
odbyło | 167 | 1 | 10 | 0.1000 |
doprowadzić | 144 | 1 | 10 | 0.1000 |
wystąpienia | 92 | 1 | 10 | 0.1000 |
2019 | 603 | 5 | 48 | 0.1042 |
pojawiają | 199 | 1 | 9 | 0.1111 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II